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Abstract. Intentionally deposited thin films exposed to atmosphere often develop 
unintentionally deposited few monolayer films of surface contamination. This 
contamination arises from the diverse population of volatile organics and inorganics 
in the atmosphere. Such surface contamination can affect the uncertainties in 
*\ determination of thickness, roughness and density of thin film structures by X- 

Ray Reflectometry (XRR). Here we study the effect of a 0.5 nm carbon surface 
contamination layer on thickness determination for a 20 nm titanium nitride thin 
film on silicon. Uncertainties calculated using Markov-Chain Monte Carlo Bayesian 
statistical methods from simulated data of clean and contaminated TiN thin films 
are compared at varying degrees of data quality to study (1) whether synchrotron 
ch sources cope better with contamination than laboratory sources and (2) whether 

cleaning off the surface of thin films prior to XRR measurement is necessary. We 
show that, surprisingly, contributions to uncertainty from surface contamination can 
dominate uncertainty estimates, leading to minimal advantages in using synchrotron- 
over laboratory-intensity data. Further, even prior knowledge of the exact nature of the 
f— ^ surface contamination does not significantly reduce the contamination's contribution 

QO to the uncertainty in the TiN layer thickness. We conclude, then, that effective and 

standardized cleaning protocols are necessary to achieve high levels of accuracy in XRR 

s . measurement. 
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1. Introduction 

X-ray reflectometry is widely used for characterizing the thickness, roughness, and 
density of nanometer scale thin films. Because it uses wavelengths of a similar or smaller 
scale relative to the thicknesses of the layers being studied, the resulting data has a 
relatively direct connection to the structure and therefore has a rather straightforward 
traceability to the International System of Units (SI) [H El |3j. This is a significant 
advantage over other techniques like spectroscopic ellipsometry (SE), whose results 
are somewhat more difficult to interpret. But x-ray reflectometry is still somewhat 
sensitive to surface contamination. A comprehensive study by Seah et al. on ultra- 
thin Si02 layers on Si has shown offsets between x-ray reflectometry (XRR), neutron 
reflectometry (NR), spectroscopic ellipsometry, and x-ray photoelectron spectroscopy 
(XPS) [4J. These offsets are believed likely due to the different effects contamination 
has for each technique. To use x-ray reflectometry for high-accuracy measurements of 
film thickness and roughness - as the National Institute of Standards and Technology 
(NIST) plans to do for thin-film standards - these effects must be quantified. 

The primary difficulty in quantifying the effects of contamination is that thin surface 
contamination layers are hard to measure. The surface contamination with which we 
are concerned typically takes the form of rough, near-monolayer-thicknes, carbonaceous 
compounds. This is challenging to measure with x-ray or neutron reflectivity-based 
methods: carbon's low scattering factor for both techniques and the poor quality of the 
carbon layers produce low-contrast fringes. 

X-ray photoelectron spectroscopy (XPS) and other inelastic scattering techniques 
can be used to measure the quantity per surface area - and thus relative thicknesses 
- of contamination layers, but generally require calibration in order to measure 
absolute thicknesses (3j. This calibration is often, for denser materials, conducted by 
reflectometry measurements. In addition, these experiments are typically conducted in 
ultra- high- vacuum, which may change the properties, or even the very presence, of the 
adsorbed volatile contamination layers. This makes using these experiments somewhat 
challenging for achieving high accuracy in absolute thickness determination. 

But a crucial question is whether these experiments are needed at all: How sensitive 
are other parameters determined by modeling of x-ray reflectometry data to the presence 
of an unknown, hard-to-measure layer of surface contamination? (E.g., how much do you 
have to know about contamination to achieve a given level of uncertainty in thickness 
for other layers within a structure?) And what sort of data quality (i.e., dynamic and 
g-space/ angular range) is needed to achieve desired levels of uncertainties? (E.g., do 
you need synchrotron data?) 

We study these questions by a simulation-based study of the effects of a carbon 
contamination layer on a TiN film on Si substrate structure being considered by NIST 
for an X-ray reflectometry standard. Using a Bayesian statistical approach to XRR 
data analysis, we estimate uncertainties for structural parameters of the high-Z TiN 
layer under various contamination and data quality conditions. 
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2. Background 

2.1. X-ray reflectometry 

X-ray reflectometry can be used to measure the density, thickness, and roughness of 
thin films which are laterally homogeneous at the scale of the beam E] • We limit 
ourselves here to the case of layered structures with fairly sharp interfaces and no density 
grading other than that provided by roughness. Density is measured by the critical angle 
for total external reflection and through the careful analysis of oscillation amplitudes; 
thickness is measured by the period of intensity oscillations appearing after the critical 
angle; and roughness is measured by the rapidity of overall intensity and oscillation 
intensity fall-off. The analysis here uses the Parratt recursion [7] with the perturbation 
for Gaussian roughness described by [8J. 

Analyzing XRR data is an inverse problem: because only the intensity - rather 
than the complex amplitude - of the reflected beam can be measured, in general there 
is no unique structure determined by an XRR pattern. The typical approach is to 
fit the parameters of a multilayer structure using an optimization approach, the most 
common being Genetic Algorithms (GAs) (9J [101 E]. Though optimization finds a 
best-fit solution - and thus a best estimate of the parameter values - it does not 
provide estimates of uncertainties on the parameters. To calculate uncertainties, a 
more sophisticated - and computationally expensive - approach is necessary. NIST has 
developed statistical Markov Chain Monte Carlo (MCMC) methods in order to obtain 
parameter estimations and uncertainties within a Bayesian formalism [12] . 

2.2. Data analysis 

To make inferences about physical structure from XRR data by Genetic Algorithm (GA) 
or Markov Chain Monte Carlo (MCMC) methods requires a physical model that relates 
structural parameters to idealized non-noisy XRR data. This is provided by the Parratt 
recursion with Nevot-Croce roughness described above. 

A physical model is not sufficient, however, because the data collected are (at 
best) noisy and (at worst) corrupted by systematic instrument effects. There must be, 
in addition, a statistical model of the data. For the GA method, this is a % 2 cost 
function; for the MCMC method, a probability density function. The cost function 
and probability density function employed here make very similar assumptions about 
the statistical characteristics of the data, see [T3j. Nonetheless, the GA and MCMC 
methods each recover different types of information from the data. 

X-ray reflectometry data consists of pairs of angles and measured intensities. 
For this study, we assume data free of angular errors with counting-error-limited 
measured intensities. The error in measured intensities is modeled using a log-normal 
likelihood with standard deviation of the square root of the calculated intensities; this 
approximates well a Poisson (i.e., counting-statistic) likelihood [T3] , 

We use a tiered data analysis architecture. XRR model parameters are first 
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Figure 1. Simulated (Cu radiation) X-ray reflectometry (XRR) data for case 2 
structural modal and laboratory quality data (see Tables [2] and [3]) . XRR simulated 
data (plus signs) has been fit using a genetic algorithm refinement (solid line) and 
yielded nearly identical parameters to those used in the simulation (Table [2]). Note 
refinement quality (magnified regions). 



determined for a given structural model using a G A optimization approach |TT] . In this 
analysis, we used a 1000 genome population evolved over 1000 generations to obtain 
best-fit structural parameters. This structural information was then used to initialize 
the starting parameters for a tuned Markov Chain Monte Carlo (MCMC) sampler to 
provide us with probability distributions for each parameter within a given structural 
model. Each MCMC was allowed a 50 000 steps conditioning run to tune the MCMC 
target dimensions. The MCMCs were then run for 250 000 steps to obtain adequate 
statistics for inter-comparison. The tiered analysis was performed several times on each 
data set to validate the refinement stability. 



2.3. Simulated data 

Two structures were simulated for this study: case 1 - a single layer of TiN on an 
Si substrate (for film parameters, see Table [I]) - and case 2 - a contaminated single 
layer of TiN on an Si substrate (see Table |2|) . Simulations were performed under two 
different data quality conditions (see Table [3]) selected to compare parameter refinement 
results between an advanced laboratory instrument and a synchrotron measurements 
from, e.g., a third-generation bending magnet beamline. By way of illustration, we 
present XRR data simulations from our case 2 structure for laboratory (see Figure [I] 
and synchrotron (see Figure [2]) data quality conditions. By considering these two cases, 
we can answer an oft-asked question for XRR data-collection: How many orders of 
magnitude data quality are needed to determine a given XRR model parameter? The 
Bayesian statistical approach used here can directly answer this question by determining 
the 'best-case' theoretically possible from XRR measurements for any refined parameter. 
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Figure 2. Simulated (Cu radiation) X-ray reflectometry (XRR) data for case 2 
structural modal and synchrotron quality data (see Tables [2] and [3]) . XRR simulated 
data (plus signs) has been fit using a genetic algorithm refinement (solid line) and 
yielded nearly identical parameters to those used in the simulation (Table [2]). Note 
refinement quality (magnified regions). 





t/nm 


cr/nm p/g cm 2 


TiN 
Si 


20.0 


0.5 4.90 
0.4 2.49 


Table 1. Case 1: clear 
and density (p). 




t/nm 


cr/nm p/g cm 2 


c 

TiN 
Si 


0.5 
20.0 


0.1 2.25 
0.5 4.90 
0.4 2.49 



Table 2. Case 2: carbon-contaminated TiN/Si structure. 



20 Step 

range size 

(degrees) (degrees) 



Maximum Background 
intensity 
(counts) (counts) 



Laboratory case 4.0 0.005 

Synchrotron case 7.0 0.005 



10 6 
10 8 



1 
1 



Table 3. Parameters of XRR data quality cases used in simulations. 



24. MCMC analysis 

The MCMC analysis method has initial optimal parameters input using the results of a 
GA. Details of MCMC methods are beyond the scope of this paper. The most important 
point is that all MCMC implementations, if properly tuned and allowed sufficient time, 
should produce the same result. Most research into, and the complications in, MCMC 
methods relate to improving sampler efficiency and thus the number of samples required. 
Thus the details of the particular sampling scheme relate mainly to efficiency; the 
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t/nm cr/nm p/g cm -2 

TiN 15.0 to 25.0 0.01 to 2.5 4.0 to 6.0 

_Si 0.01 to 2.5 2.0 to 3.0 

Table 4. Model 1: Allowed MCMC (uniform prior) ranges for TiN/Si structure with 
no surface contamination layer. 





t/nm 


cr/nm 


p/g cm" 2 


c 


0.0 to 2.0 


0.01 to 2.5 


2.0 to 3.0 


TiN 


15.0 to 25.0 


0.01 to 2.5 


4.0 to 6.0 


Si 


- 


0.01 to 2.5 


2.0 to 3.0 



Table 5. Model 2: Allowed MCMC (uniform prior) ranges for TiN/Si multilayer with 
surface contamination layer. 





t/nm 


cr/nm 


p/g cm 2 


c 


0.5 


0.1 


2.25 


TiN 


15.0 to 25.0 


0.01 to 2.5 


4.0 to 6.0 


Si 


- 


0.01 to 2.5 


2.0 to 3.0 



Table 6. Model 2a: Allowed MCMC range for TiN/Si multilayer with a known surface 
contamination layer. 



resulting samples are from the same Bayesian posterior probability distribution. 

It is important for interpreting the probability distributions sampled by MCMC 
to know three modeling assumptions: (1) the allowed prior ranges for each parameter 
within a model, (2) the assumed prior distributions for each parameter, and (3) the type 
of noise assumed within the data. In this work, we use ranges for a uniform prior which 
assume the same physical structure (same number of layers) as the simulated data; the 
ranges of the priors are generous to allow the MCMC to sample a wide parameter space. 
In Table [4] we give the allowed parameter ranges for case 1. In Table [5j we provide the 
ranges used in case 2. In Table |6j we use a highly constrained model for case 2 with 
all the values of the carbon contamination layer fixed at their simulated values; i.e., we 
assume we know the nature of the contamination exactly. In each model, we assume a 
uniform prior distribution for thickness, roughness, and density. In all cases we assume 
the noise in actual measured data - and thus the likelihood of the data for a given set 
of parameter values - is Poisson. 

3. Results 

Statistically-determined uncertainties for laboratory and synchrotron levels of data 
quality have been calculated using the MCMC method for a simulated clean TiN 
sample (case 1) with corresponding modeling ranges (model 1) and a simulated carbon 
contaminated TiN sample (case 2) with its corresponding modeling ranges (model 2). 
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Absolute and relative uncertainties for each structural parameter and each data quality 
are presented. We also present a modified analysis for case 2, in which we provide 
the exact parameters for the contamination layer as prior information for the MCMC 
method (model 2a) and discuss the resulting uncertainties. 

3.1. Clean sample - case 1 

The power of the Bayesian analysis via MCMC is through its generation of posterior 
probability distributions for each parameter within a physical model, clearly showing 
the uncertainty ranges (for example, see Figure [3]). The expanded uncertainties can be 
directly calculated by finding the parameter bounds for the probability distribution plot 
area representing the 95 % highest probability. For case 1, these expanded uncertainty 
ranges are tabulated in Table [7) 

For a clean, single-layer structure, there is a clear (factor of two) advantage to 
synchrotron measurements with regards to determination of accurate thickness and 
roughness information. This statistical determination method for uncertainty estimation 
is absent from optimization refinement methods such as GAs. Studying 2-dimensional (2 
simultaneous parameters) posterior probability distributions allows us to qualitatively 
and quantitatively explore parameter correlations. The clear improvements in TiN 
thickness and roughness seen in Table [7] are due to the orthogonal (no correlation) 
nature of thickness and roughness (see Figure Q. As a general rule, when no correlation 
exists between parameters within a refinement, then better data quality will directly 
correspond to reduced uncertainties for the parameters in question. However, if 
correlations do exist between two or more parameters, as for example between film 
roughness and film density (see Figure [5]) in case 1, then correlations will introduce 
intrinsic parameter uncertainties which cannot be further reduced with higher data 
quality. 

As seen by studying the ratios of uncertainty estimates (last column in Table [7]) , 
when determining the density of either the TiN or the Si substrate, there is no clear 
advantage between the laboratory and the synchrotron levels of data quality. The 
relative quality of density determination is nearly identical in both cases (uncertainty 
ratios equal to 1). This constant nature of density uncertainty over both levels of 
data quality is likely caused by two factors: First, that both datasets have the same 
spacing between collection points, so that the critical angle is not much more precisely 
determined by synchrotron data than the laboratory data. Second, density correlates 
with other modeling parameters, for example, interface roughness (see Figure [5]). 

3.2. Carbon contaminated film - case 2 

For case 2, we present the expanded uncertainty ranges in Table [8] and see several 
surprising results. When one introduces a carbon contamination layer, the advantages 
in reduced uncertainties from using the higher data quality of a synchrotron vanishes for 
all but roughness determination. For ratios of unity or near unity, (e.g., 0.93, 0.75) there 
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Parameter 


u 


U 


[U(lab) / 




(synchrotron) 


(laboratory) 


U(sync)] 


^TiN 


0.048 nm 


0.11 nm 


2.3 


C"TiN 


0.033 nm 


0.060 nm 


1.8 


OSi 


0.050 nm 


0.139 nm 


2.8 


pTiN 


1.08 g/cm 2 


1.05 g/cm 2 


1 


PSi 


0.90 g/cm 2 


0.90 g/cm 2 


1 



Table 7. MCMC-determined expanded uncertainties (95% probability intervals) 
for the model parameters of case 1, model 1, and the ratio of these uncertainties, 

[U(lab)/U(sync)] 




Figure 3. TiN thickness (ti) posterior probability density for case 1 using synchrotron 
quality data. 




0.32 0.34 0.36 0.38 0.40 0.42 0.44 0.46 
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Figure 4. 2-dimensional histogram showing no correlation between TiN thickness 
(ti) and TiN interface roughness (<ti) for case 1. Intensity scale of histogram shows 
the relative frequency with which the Monte Carlo Markov Chain explores a given 
parameter space. (High frequencies directly correspond to high probabilities for a 
well-tuned MCMC.) 
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Figure 5. 2-dimensional histogram showing correlation between TiN density (p\) and 
TiN interface roughness (<j\) for case 1. Intensity scale of histogram shows the relative 
frequency with which the Monte Carlo Markov Chain explores a given parameter space. 



is no clear advantage to synchrotron data. [Apparent disadvantages to synchrotron data 
(i.e., ratios less than one) are artifacts to the coarseness of our sampling analysis.] This is 
partially a consequence of high inverse correlation between contamination layer thickness 
and TiN thickness (see Figure [6]) . Correlations also exist between the contamination 
density and surface roughness (see Figure [7|, further expanding uncertainties throughout 
the model parameters. 

When one examines only the highest quality data (synchrotron) , the effect of clean 
vs. contaminated surfaces can be directly compared. In Table |9| we see that only 
the TiN thickness and roughness show pronounced reductions in uncertainty ranges 
from a contaminated vs. clean structure. An astute observer may wonder why the 
uncertainty for Si and TiN density are not improved through the removal of the carbon 
contamination layer. This is because XRR, in some cases, is simply not sensitive to a 
given model parameter. This sensitivity issue can be distinguished from a correlation 
phenomena, again by using the MCMC posterior probability densities or 2-dimensional 
histograms, and looking for parameters which produce uniform posteriors out the 
analysis. In Figure |HJ we see that the Si density probability density is nearly uniform 
over the allowed range of the parameter. This lack of a pronounced peak demonstrates 
very little sensitivity to Si density in our data. 



3. 3. Contamination of known thickness, roughness, and density - case 2a 



In Table 10 , we introduce the results from our known parameter carbon contamination 



case. We compare the uncertainties between clean, unknown contamination, and exactly 
known contamination cases The most interesting feature is the TiN thickness. Even for 
the case where the contamination thickness, roughness, and density are known a priori, 
the model still has 5 times higher TiN thickness uncertainty over the clean surface case. 
There is a reduction of a factor of 2 over the unknown contamination case; this reduction 
indicates that not knowing the exact properties of the contamination layer does have an 
effect on uncertainties. (Or, identically, that refining an unknown contamination layer 
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Figure 6. 2-dimensional histogram showing inverse correlation between contamination 
layer thickness (£]_) and TiN thickness (£2) for for case 2. Intensity scale of histogram 
shows the relative frequency with which the Monte Carlo Markov Chain explores a 
given parameter space. 




0.10 
o-o/nm 



Figure 7. 2-dimensional histogram showing correlation between contamination layer 
density (pi) and surface roughness (<Jo) for case 2. Intensity scale of histogram shows 
the relative frequency with which the Monte Carlo Markov Chain explores a given 
parameter space. 




0.85 0.90 0.95 1.00 1.05 1.10 1.15 



Figure 8. Posterior probability density for Si substrate density (^3) showing lack of 
sensitivity for this parameter within the XRR model for case 2. 
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increases the uncertainties.) But this effect is much smaller than the effect of the mere 
presence of the contamination layer. This is because the presence of the contamination 
layer causes a decrease in the contrast of the TiN layer thickness fringes, decreasing 
the ability of higher quality data to provide more information. This manifests itself in 
correlations in the model which can substantially increase the overall uncertainties for 
the TiN layer thickness. 

The very presence of contamination on the structure - rather than the need to fit 
the contamination - has the largest effect on the uncertainty. 

4. Conclusions 

There are some caveats to this conclusion: Only simulated data has been considered. All 
systematic instrumental errors have been neglected. No instrument response functions 
have been modeled. The comparison between laboratory and synchrotron data is made 
only for the case of a Cu Ka laboratory source radiation and an synchrotron beamline 
set to the same energy - so any advantage to tuning the energy of the beam for specific 
materials and structures has been neglected. (For an example of XRR fit improvement 
through judicious source energy selection using a synchrotron, see [H]). 

But accepting these limitations, save perhaps source energy tuning, as not being 
likely to improve data quality, the MCMC XRR analysis technique provides a powerful 
tool for studying the theoretical limitations of XRR measurements of a structure before 
taking measurements. 

In this case of a carbon contamination layer, we see that the theoretical 
uncertainty estimates for parameters are dominated by correlations between the surface 
contamination thickness and the TiN thickness increasing the uncertainty estimates for 
our thin film of interest whenever the carbon contamination layer is present. Even 
when armed with prior knowledge of all parameters for the contamination layer, we see 
a five-fold increase in the TiN thickness uncertainty caused by the introduction of the 
contamination layer. Higher data quality will provide significant reductions in parameter 
uncertainties for simple XRR models, such as the clean TiN thin film. However, in the 
presence of contamination, we see minimal gain through enhanced data quality for the 
determination of thin film thickness. 

This MCMC simulated data study has shown that removing the contamination 
is essential to significantly reducing the uncertainties in the high-Z layer thickness 
measurement. 
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Parameter 


u 


U 


[U(lab) / 




(synchrotron) 


(laboratory) 


U(sync)] 


tc 


0.64 nm 


0.66 nm 


1.0 


^TiN 


0.67 nm 


0.62 nm 


0.93 


^C + ^TiN 


0.32 nm 


0.24 nm 


0.75 


(7C 


0.033 nm 


0.16 nm 


4.8 


0"TiN 


0.50 nm 


0.48 nm 


0.96 


^Si 


0.027 nm 


0.20 nm 


7.4 


PC 


0.61 g/cm 2 


0.92 g/cm 2 


1.5 


PTiN 


1.49 g/cm 2 


1.28 g/cm 2 


0.86 


PSi 


0.94 g/cm 2 


0.94 g/cm 2 


1 



Table 8. MCMC-determined expanded uncertainties (95% probability intervals) 
for the model parameters of case 2, model 2, and the ratio of these uncertainties, 
[U(lab)/U(sync)]. 



Parameter 


U( clean) 
(synchrotron) 


U(with carbon) 
(synchrotron) 


[U (carbon) / 
U (clean)] 


^TiN 


0.048 nm 


0.67 nm 


14 


^surface 


0.033 nm 


0.033 nm 


1 


0"TiN 


0.033 nm 


0.5 nm 


15 


^Si 


0.050 nm 


0.027 nm 


0.54 


PTiN 
PSi 


1.49 g/cm 2 
0.94 g/cm 2 


1.49 g/cm 2 
0.94 g/cm 2 


1 
1 



Table 9. Comparison of MCMC-determined expanded uncertainties (95% probability 
intervals) between clean vs. contaminated cases for synchrotron quality data 



Parameter 


U (clean) 


U (carbon) 


U (known 
carbon) 


^TiN 


0.048 nm 


0.67 nm 


0.31 nm 


^"surface 


0.033 nm 


0.033 nm 


fixed 


0"TiN 


same as cr sur f ace 


0.5 nm 


0.36 nm 


^Si 


0.050 nm 


0.027 nm 


0.027 nm 


PTiN 


1.49 g/cm 2 


1.49 g/cm 2 


1.49 g/cm 2 


PSi 


0.94 g/cm 2 


0.94 g/cm 2 


0.93 g/cm 2 



Table 10. Comparison of MCMC-determined expanded uncertainties (95% 
probability intervals) between clean, unknown, and known contamination layer cases 
for synchrotron quality data 
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on thin film characterization and for providing numerous interesting sets of XRR data 
for the development of analysis techniques. 
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